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Abstract. We analize heartbeat time-series corresponding to several 
groups of individuals (healthy, heart transplanted, with congestive heart 
failure (CHF), after myocardial infarction (MI), hypertensive), looking 
for short- and long-time statistical behaviors. In particular we study the 
persistency patterns of interbeat times and interbeat-time variations. 
Long-range correlations are revealed using an information-based tech- 
nique which makes a wise use of the available statistics. The presence 
of strong long-range time correlations seems to be a general feature for 
all subjects, with the exception of some CHF individuals. We also show 
that short time-properties detected in healthy subjects, and seen also in 
hypertensive and MI patients, and completely absent in the trasplanted, 
are characterized by a general behavior when we apply a proper coarse- 
graining procedure for time series analysis. 



1 Introduction 

In the last few years, time series analysis techniques have been applied to heart- 
beat sequences to detect long-range memory . With these methods one can 
in principle distinguish between different statistical properties that may corre- 
spond to different pathophysiological condition. Another important goal of this 
analysis is to unravel the hidden control mechanisms responsible for the heart- 
beat dynamics. Indeed heart rate variability is influenced by the "competition" 
between sympathetic and parasympathetic nervous system activity as well as by 
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non-autonomic factors. Consequently, an analysis regarding different time scales 
can in principle disentangle the different contributions to the heart rate variabil- 
ity. In this work we use a recently developed technique for measuring long-range 
correlations called Diffusion Entropy (DE) |Qj5||^. We also focus on short-range 
time behavior and, using a coarse-graining procedure, we identify an objective 
resolution time for the measure of the interbeat distance. We use this results to 
compute the relative importance of the short-time dynamics with respect to the 
long-term one. 



2 Materials and methods 

2.1 The difTusion entropy technique 

In order to process the data, the first step consists of a coarse-graining proce- 
dure on the RR time series. This strategy, shared also by other groups (see, for 
instance, Ref. ||^) makes a statistical treatment of the data possible. 

We define a coarse-graining parameter s and we obtain a new series, namely 

T,(s) = [Tjsl (1) 

where Ti is the time distance between the i*'* and the i — 1*'* beat, Ti{s) is the 
coarse-grained time series and [•] denotes the integer part. Equation (|l|) means 
that we preliminary divide the interbeat distance scale in several "boxes" of size 
s and we assign to all the interbeat distances lying in the same "box" the same 
value in the sequence Ti{s). Next, we convert this series into a dichotomous 
sequence, setting = I when Ti{s) ^ Ti„i(s), and = if Ti{s) = Ti-i{s). Fi- 
nally, we generate several trajectories (labelled with the index I) for the variable 
xi at "time" t, namely, 

i+t 

Note that, for simplicity, we have omitted indicating the dependence on s. As 
shown in Ref. if the sequence is ergodic, then the probability distribution 
of the variable function of t is expected to fit the scaling property 

with the "degree of anomaly" being measured by the distance of the scaling 
parameter S from the standard value 0.5 corresponding to Brownian motion. 
It is straightforward to prove that the Shannon entropy 

/oo 
p{x,t)\n{p{x,t))dx (4) 

of a process fitting the scaling condition of Eq. (0) yields 



S{t) ^ A + S\n{t), 



(5) 
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where A is a constant, whose exphcit form is not relevant for the ensuing discus- 
sion. This resuh is immediately obtained by plugging Eq. (||) into Eq. (^). The 
above method of evaluating the scaling parameter 6 is called Diffusion Entropy 
(DE), and, as it will become clear later, this technique is more efficient than the 
calculation of the second moment of the probability distribution. Note that when 
the distribution density under study departs from the ordinary Gaussian case 
and the function F{y) has slow tails with an inverse power law nature Q the 
second moment is a divergent quantity. This diverging quantity is made finite by 
the unavoidable statistical limitation. In this case, the second moment analysis 
would be determined by the statistical inaccuracy, thereby leading to misleading 
conclusions, while the method based on Eq. (||) yields correct results [§|J^. 



2.2 Short-time analysis 

Let us define the sequence {tj} as the distance between I's in the series 
The authors of Ref. [Q studied the case where the probability distribution for 
r decays as an inverse-power law with exponent > 2. If Tj were uncorrelated, 
then a simple expression would exists, linking /i and S, namely 

(5=^— ((5 = 0.5 if /i> 3). (6) 
fi i 

In our analysis we observe a strong discrepancy between the values of S « 0.8 
and of /i w 5, thus suggesting that the sequence of r, has a long time memory |^. 
This is why we perform the normalized auto-correlation function of the sequence 
Tj defined as 



((r,. ~(r))(r,.+, - (r))) 
(rl) - (r) 

where (■) is the arithmetic average, e.g 



1 ^ 



/c=0 

and N is the number of terms in the sum. 

At this point we realize that for most data the correlation function ^tU) 
actually decays as an inverse power law after an abrupt fall occurring at the 
transition from j = to j = 1. The value at j — 1, defined as e^, exhibits a 
peculiar behavior as a function of the coarse-graining parameter s. We will show 
that this behavior is typical of all the groups, with the exception of transplanted 
patients and some CHF individuals. 



2.3 RR time series 

The RR series analyzed are taken from the NOnLinear Time Series AnaLysIS 
(NOLTISALIS) archive. This data set is the result of the collaboration of several 
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interdisciplinary Italian research centers. The data correspond to 50 patients 
classified in 5 groups of 10 subjects each: normal, hypertensive, post-MI, with 
congestive heart failure, and heart transplanted. 

For each subject there are 24-hour Holter recordings, whose sample frequen- 
cies range from 128 to 1024 Hz, according to the type of pathology. 



3 Results 

3.1 Long-term analysis 

The analysis performed with the Diffusion Entropy technique yields very accu- 
rate fits as shown in Figs. |^ and ^. The average results for each group of patients 
is shown in table RT. 




Fig. 1. Diffusion Entropy S(t) for healthy patients. The straights lines in log- 
linear scale represent best fits for Eq. (5) 



As shown in table |l|, the values of 5 indicate very strong correlations for all 
the different groups. The CHF individuals, however, as previously shown in Ref. 
[^1^, are characterized by a somewhat smaller value. One of the main results 
of this paper is that, remarkably, the results from table |^ are independent of 
the coarse-graining parameter s. As we shall show later, this property can be 
accounted for by a simple intermittent model. 

On the other hand, the coarse-graining parameter s becomes very important 
in evaluating the correlation function ^r{j)- In fact, different values of s yield 
different series for r, while a large/small coarse graining leads to higher/lower 
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Fig. 2. Diffusion Entropy S(t) for patients hypertensive. The straights hnes rep- 
resent best fits for Eq. (5) 



Table 1. Average value of 6 for each group of patients 



Group 


(<5> 


healthy 
CHF 

post-MI 
transplanted 
hypertensive 


0.83 ± 0.02 

0.80 ±0.05 
0.85 ±0.04 
0.86 ±0.08 
0.87 ±0.02 
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value for (r). We investigate the different correlation functions stemming from 
different coarse-graining values s. As earlier stated, in most cases the correlation 
function presents an abrupt drop going from j = to j = 1. After this fast drop, 
it decays to zero very slowly, like an inverse-power law. In fact, the data are not 
clean enough to establish the inverse power law index with a good accuracy. This 
means that the DE is a much more efficient method to detect memory and much 
less ambiguous than the correlation function, and we can actually describe the 
meaning of S making use of a general class of dynamical models. The correlation 
function 'PtU), on the other hand, depends on the details of the model and does 
not afford an easy way to derive it. 

The numerical form of the correlation function ^t(j) is 

<?r(j) = (l-e')<5,,o + e'C(j). (9) 

Here Sj,o denotes the Kronecker 6 step, while the function C{j) is a smooth 
function with the property C(0) = 1. We account for the long-term properties, 
namely second term in the r. h. s of (^), as follows. We hypothesize that the 
heartbeat rate, on a rather long time scale, undergoes trends of acceleration and 
deceleration of slopes a, whose duration length r are asymptotically distributed 
with an inverse power law, with index /i'. We suppose that the consecutive trends 
are not correlated and consequenlty the scaling parameter S is connected to fi' 
viai 

Here we sketch a proof of Eq. (p^. First of all we define the distribution prob- 
ability for the slope a, f{a). We make the simplifying and plausible hypothesis 
that this distribution has no infinite moments. This restriction does not apply to 
the T-distribution, which has, on the contrary slow tails and diverging moments. 
Without loss of generality we assume the following form 

^{r) = r, (11) 

where A = (t) /{^' — 2). During one single trend of acceleration or deceleration 
the position of the walker xi (t) is displaced by a quantity 

. = ^r, (12) 

s 

which is the number of times the walker crosses the "box boundaries" defined by 
the coarse-graining procedure on Ti . Consequently, we can write the probability 
of a displacement z that reads 

oc oc 

p{z) ^ j j Aa5 (z - ^) V(r)/(a). (13) 

-oo 

After many trends it is straightforward to see, e. g. along the lines of Ref. 
that the movement of our walker is essentially equivalent to a process where a 
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walker, at regular time distances of duration (r), walks by a quantity Ax ~ z, 
distributed as 

Since the function ip has a long tail, this latter process, according to the Gen- 
eralized Central Limit Theorem ||^, results in a Levy Flight, which is a stable 
process with diverging central moments. The difficulty of dealing directly with 
the distribution ( p^ is due to the fact that a is itself a stochastic variable. 
However we can detect the asymptotic scaling property of the walker xi{t) by 
studying the first diverging moment of the distribution p{z). To find this value 
we evaluate 

oc 

(z'^) ^ [ p{z)z^Az. (15) 



The smallest value of 7 for which this integral diverges, called 7, defines the 
scaling 5 via (5 = I/7 Plugging Eq. ( p^ ) into (p^, we obtain 

oc 00 00 

(z^) ^ j dz j At j da ,5 (z - ^) V(r)/(a)z''. (16) 

-oc 

Integrating over z, this result can be written as 

(Z^) = ( / ^(T)T^dT I • F„ (17) 



where 

= / da ( ^ ) /(a). (18) 



lal ^ 



This means that only the first factor might make (r^) divergent, thereby imply- 
ing that 7 = /i' — 1 and proving Eq. ( p^ . This also proves that 8 does not depend 
either on the coarse- graining parameter s or on the distribution of a, which in 
fact contribute only to Fg. 

We now establish an interesting connection with the two-walkers model of 
Ref. |6| as follows. We assume that the trajectory Ti is a zig-zag path. Each 
straight line of this path is determined by the slope a^-, which connects the 
persistency time Tj to the coarse-grain parameter s, by 

a, = -. (19) 

If the straight line portion of the zig-zag curve has length Ij , we have about IjUj / s 
identical values of Tj . These are the pseudoevents of the two- walkers model of 
Ref. i. 
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3.2 Short-term analysis 

Of course this crude model can only describe the long-time regulation of the 
heartbeat dynamics and has to be supplemented with middle- and short-time 
corrections so as to take into account the abrupt fall corresponding to the first 
term in the r.h.s. of Eq. (^). Indeed short-term control mechanisms are due to 
the activity of the autonomic nervous system which presents a very short time 
scale. 



0.18 




0.02 

10 100 

s (msec) 



Fig. 3. Short-time correlation parameter as a function of the coarse-graining 
resolution s for all the the healthy subjects. Two of them are plotted with a 
much higher number of points, to show both the subtle erratic dependence and 
the "continuous" one. 



We use a short-time (local) perturbation model which does not destroy long- 
time properties, in a way that is similar to the point-like mutations affecting 
the long-range correlations in DNA sequences The value of does in fact 
depend on the value of s and therefore we compute e^(s) in order to investigate 
the properties of these control mechanisms. Surprisingly enough, the function 
e^(s) show a quite universal feature for all healthy and hypertensive individuals: 
we observe a steep increase as s goes from few msec up to Smax — 30 msec where 
we have a maximum. After that we have a slow decrease until 200 msec. The 
most striking result is that, even if the maximum value is quite different among 
the various individuals, the values of Smax seems to be universal, the variability 
among different patients being lower than 5 msec. This result is shown in Fig.^. 
This feature is somewhat visible (always for Smax = 30 msec) also in post- 
Mi or hypertensive patients (see Fig. ^) and even in some CHF ones, while it 
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is completely absent in some CHF patients and in all the heart transplanted. 
Results for these latter groups are shown in Fig. 0. 



0.18 




' ' 1 

10 100 
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Fig. 4. Short-time correlation parameter as a function of the coarse-graining 
resolution s for 5 hypertensive patients and 5 post-MI patients. We plot only 
half of these groups for plot clarity. 



To the best of our knowledge, the interpretation of Smax is not easily ex- 
plained in physiological terms. With the due caution, we may intepret this scale 
as an intrinsic resolution for the assessment of the heartbeat rate from the con- 
trol system. In other words there should be a natural coarse graining in the 
physiological determination of the interbeat time distance. The fact that this 
behavior is missing in the case of heart transplanted patients seems to imply 
the action of the autonomic system. Anyway, the measured timescales rule out 
the possibility that this maximum is due to the sampling rate of the measuring 
apparatus. We conclude that more work has to be done to explain this effect 
throughfuUy. Finally, from a methodological point of view we use this property 
to evaluate in an objective way a value for the parameter e^, which is a mea- 
sure of the regularity of the interbeat time accelerations a during the trends of 
variation. 

At this point, as done in Ref. we can plot the value of 5, independent 
of our coarse graining, as a function of our objectively defined value of e^. The 
result is shown in Fig. We recall that both parameters reflect memory, the 
former being associated with long-range fractal properties, and the latter with 
short-time persistency in the heart acceleration patterns. This latter behavior 
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10 100 
s (msec) 

Fig. 5. Short-time correlation parameter as a function of the coarse-graining 
resohition s for all 10 transplanted patients. The structure of the previous figures 
is absent. 
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Fig. 6. Bidimensional plot of i5(e^) for all the 50 subjects. 



is certainly connected with multifractal properties, as reported e.g. in Ref. ||l 
Arguably, the proposed treatment here proposed is more intuitive. 
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Fig. 6 shows the {e^, S} scatter plot for all 50 subjects. Even though there is 
no satisfactory statistical discrimination among groups, maybe due to a variety 
of factors like the limited number of patients, differences in illness seriousness 
and differences in therapy, we observe that healthy individuals are concentrated 
in a small portion of the graph, in agreement with the results of Rcf. Q. Hy- 
pertensive patients, too, have their peculiar region, characterized by an higher 
value of S. Other pathologies seem to occupy broader regions. A significant por- 
tion of pathological individuals occupy the "healthy zone" , and these may be 
associated to less serious conditions. Actually, preliminary results (unpublished) 
on an enlarged group of CHF patients seem to suggest a correlation between 
serious condition and the absence/distorsion of the e^(s) peak at 30 msec. These 
patients are also characterized by a smaller value of e^(s) and they are therefore 
mostly outside the "healthy" zone. 

4 Conclusions 

In conclusion, this study allows us to propose, on the basis of a simple statistical 
analysis, a series of conjectures to be accurately tested by further studies. The 
physiological long-range memory, probably due to a cascade of mechanisms at 
different timescales, can be characterized by two parameters, the scaling S of the 
underlying infinite- memory process, and the weight e^, which gives information 
of the relative importance of this latter process versus short-time mechanisms. 
The presence of the 30 milliseconds peak of Fig. which allows an objective 
definition of seems to be physiological and corresponds to the presence of a 
fast control mechanism, which has to react in a time shorter than few heartbeats 
to properly explain the abrupt fall of the correlation functions. This control 
mechanism seems to act with a special time resolution: a "grid" of about 30 
milliseconds. The absence, distorsion, or shift of this internal resolution may 
reflect a pathological condition and is often associated with a lower degree of 
long-time memory. 

Finally, we stress that the coarse-graining procedure adopted herein yields re- 
sults that are independent of the coarse-graining parameter, making this analysis 
robust. We believe that this analysis can be applied with success to a variety of 
time series with both long- and short-time structures, and even to nonstationary 
signals. Results stemming form our procedure may thereby provide additional 
information useful in many diagnostic and prognostic problems. In general, we 
envisage our procedure as an automatic classifier for those complex sequences 
that are not easily dealt with, using usual statistical learning methods. Appli- 
cations may range from natural-language semantic classifiers based on discourse 
dynamics to predictors of catastrophic events |^ . 
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